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(54) System and method for reducing the footprint of preloaded classes 



(57) A method and system that reduces the ROM 
space allocated for internal data structures by a runtime 
engine (such as the Java virtual machine). The internal 
data structures store member information for preloaded 
classes used by applications executed by the runtime 
engine. The system determines the different types of in- 
ternal data structures represented in the classes and 
identifies the possible values of each type's members. 
The system next determines the amount of space re- 
quired to store the values for each type in a respective 



value table and the number of bits needed to index each 
entry of that table. The system determines based on the 
stored information whether occurrences of a member 
are optimally represented as a set of value table indices 
and a value table or, in the conventional manner, as a 
general variable that stores the member's value for each 
occurrence. This determination is based on the size of 
the general variable, the number of occurrences of the 
member, the memory needed for each index and the 
size of the value table. 
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Description 

[00011 The present invention relates generally to a class preloader and, particularly, to a system and method for 
reducing the size in read only memory of preloaded Java classes. 

BACKGROUND OF THE INVENTION 

[0002] A Java program comprises a number ot small software components called classes. Each class contains code 
and data and is defined by information in a respective class file. Each class file is organized according to the same 
platform-independent 'class file format*. Referring to FIG. 1, there is shown a block diagram of the class file format, 
according to which each class file 400 includes header information 402, a constant pool 404, a methods table 406 and 
a fields table 408. The header information 402 identifies the class file format, the size of the constant pool, the number 
of methods in the methods table 406 and the number of fields in the fields table 408. The constant pool 404 is a table 
of structures representing various string constants, class names, field names and other constants that are referred to 
within the class file structure and its sub-structures. The methods table 406 includes one or more method structures, 
each of which gives a complete description of and Java code for a method explicitly declared by the class. The fields 
table 408 includes one or more field structures, each of which gives a complete description of a field declared by the 
class. An example of the fields table 408 is now described in reference to FIG IB. 

[0003] A Java program is executed on a computer containing a program called a virtual machine (VM), which is 
responsible for executing the code in Java classes. It is customary for the classes of a Java program to be loaded as 
late in the program's execution as possible: they are loaded on demand from a network server or from a local file 
system when first referenced during the program's execution. The VM locates and loads each class, parses the class 
file format, allocates internal data structures for its various components, and links it in with other already loaded classes. 
This process makes the method code in the class readily executable by the VM. 

[0004] For small and embedded systems for which facilities required for class loading, such as a network connection, 
a local file system or other permanent storage, are unavailable, it is desirable to preload the classes into read only 
memory (ROM). One preloading scheme is described in U.S. Patent Application Serial No. 08/655,474 ("A Method 
and System for Loading Classes in Read-Only Memory"), which is entirely incorporated herein by reference. In this 
method and system, the VM data structures representing classes, fields and methods in memory are generated offline 
by a class preloader. The preloader output is then linked in a system that includes a VM and placed in read-only 
memory. This eliminates the need for storing class files and doing dynamic class loading. 

[0005] Referring to FIG. 2A, there is shown a more detailed block diagram of the VM data structures 1 200 generated 
by the class preloader. The data structures 1200 include a class block 1202, a plurality of method blocks 1204, a 
plurality of field blocks 1 21 4 and a constant pool 1224. 

[0006] The class block 1202 is a fixed-size data structure that can include the following information: 

• the class name 1 230; 

• a pointer 1 232 to the class block of the current class's immediate superclass; 

• a pointer 1 234 to the method blocks 1 204; 

• a pointer 1 236 to the field blocks 121 4; and 

• a pointer 1 238 to the class 1 constant pool; 

[0007] The elements of a class block data structure are referred to herein as class block members. 
[0008] A method block 1 204 is a fixed-sized data structure that represents one of the class's methods. The elements 
of a method block data structure are referred to herein as method block members. A field block 1214 is a fixed-size 
data structure that represents one of the class's instance variables. The elements of a field block data structure are 
referred to herein as field block members. 

[0009] Each type of VM data structure, including the class block 1202, method blocks 1204, field blocks 1214 and 
constant pool 1 224, has a format defined by a corresponding data structure declaration. For example, a single method 
block declaration defines the format of all method blocks 1204. The data structure declarations also define accessor 
functions (or macros) that are used by the VM to access data structure members. These data structure declarations 
are internal to the VM and are not used by class components. The prior art data structure declarations are now described 
in reference to FIG. 2B. 

[001 0] Referring to FIG. 2B, there is shown a depiction of data structure declarations 1 230 that define the format of 
all data structure types employed by a particular VM. Each declaration 1230 includes a set of member declarations 
1232 and accessor functions 1234 for accessing respective members. The member declarations 1232 and accessor 
functions 1234 are defined conventionally, according to the syntax of the language used in the implementation of the 
VM. For example, assuming the C language is used in the data structure declarations 1 230, a generic field data structure 
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1230.N (shown in FIG. 2B) could be defined as a structure T with five members of the following types with respective 
accessor functions: 



member name member type accessor functions 


memberl 
member2 
member3 
member4 
members 


mtypel 
mtype2 
mtype3 
mtype4 
mtypeS 


meml of (T) T->member1 
mem2 of (T) T->member2 
mem3 of (T) T->member3 
mem4 of (T) T->member4 
memS of (T) T-> members 



[0011] In this example, the member types can be any type defined by the relevant computer language, including 
user defined types or language types, such, as integer, float, char or double. The accessor functions are macros used 
by the VM to access the fields without needing to access directly the structure containing the field. For example, instead 
of employing the expression "T->member1 " to access fieldl in structure type T, the VM need only employ the expression 
"meml of (T)'. Accessor functions are well known in programming languages : such as C, that provide sophisticated 
data structure capabilities. 

[0012] The internal data structures used to store "class meta data" (i.e., the class, method and field blocks 1202, 
1204, 1214) require large, fixed amounts of space in read-only memory. In fact, measurements indicate that this sort 
of class meta data often takes up much more space than the bytecodes for the class methods themselves. These 
internal data structures are therefore often unsuitable for use in small, resource-constrained devices in which class 
preloading is desirable and/or necessary. 

[001 3] Moreover, if the internal data structures were individually modified to save memory space, the VM code would 
need to be extensively revised to enable the VM to correctly access the modified data structures. To make such changes 
to the VM could be onerous and inefficient. - 
[0014] Therefore, there is need for a modified representation of the internal data structures that is smaller in size 
than the prior art data structures, includes all information required by the VM, and does not require extensive or onerous 
modification of the VM code. 



30 SUMMARY OF THE INVENTION 



[0015] In summary, the present invention provides a method and system that reduces the ROM space required for 
preloaded Java classes. 

[0016] In particular, the method and system of the present invention are based upon the realization that, in an envi- 
ronment where the Java VM classes are preloaded, it is highly likely that the VM would be a closed system with a set 
number of classes and class components, such as fields and methods. Such a closed VM would include a fixed number 
of internal data structures, such as class blocks, method blocks and field blocks. Moreover, each member of these 
data structures (e.g., a method block or field block member) would have one of a well-known set of distinct values. 
[0017] Given this assumption and its implications, embodiments of the present invention reduce the memory space 
required to represent the internal data structures by: 

1) determining distinct values of each type of data structure member; 

2) determining occurrences of each data structure member type (e.g., each occurrence in the method blocks of a 
field block member type) and each occurrence's value; 

3) determining memory space that would be saved if each occurrence were represented as an index to a table of 
values of the data structure member type rather than conventionally (storing the value for each occurrence in a 
general variable); and 

4) if sufficient savings would result, allocating a value table containing the distinct data structure member type 
values and configuring each occurrence of that field block member type as an index to the appropriate value table 
entry; and 

5) generating new sources to the VM so that its access to the modified structures is adapted automatically. 



[0018] In a preferred embodiment, the decision is made to represent a data structure member type as a value table 
index plus a value table if the following comparison is true: 

(#occurrences of type) x (size of index) + (size of value table) < 

(#occurrences of type) x (size of general variable). 
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[001 9] Once the present method has determined for each data structure member type whether an occurrence of that 
type is to be represented as an index into a value table or as a general variable storing the value, the present method 

s emits appropriate information for that type, including accessor functions, language declarations and source code that 
initializes the value tables. The accessor functions are macros through which all runtime access to the data structure 
members is accomplished by the VM. Preferably, prior to emitting the above-described information, the present method 
determines the most compact arrangement of the value table indices, conventional representations of members and 
value tables and generates the value tables, value table indices, accessor functions and classes accordingly. 

w [0020] The present method emits accessor functions, declarations and other data structure information after deter- 
mining whether to modify the conventional representation of the data structure members. As a result, all emitted data 
structure information is consistent with changes in the internal class representation. This automatic generation of con- 
sistent data structure information minimizes changes to the VM that are required whenever new classes are added to 
the VM, ancl whenever class representations change. This provides a significant improvement over the prior art. 

is [0021] Embodiments of the system of the present invention include a collection of class files, a Java class preloader 
in which the above method is implemented and output files generated by the preloader, including preloaded classes, 
header files and source code files. 

[0022] The class files define the complete set of classes to be preloaded. The preloader performs a first pass on the 
class files to determine the: different types of members of the internal data structures, 



[0023] The preloader then performs a second pass on the class files and the internal data structures to determine 
how each member is to be represented, conventionally or as an index to a value table entry, and then emits the ap- 
propriate output files. 

[0024] The output files are compatible with similar files employed by conventional Java systems. That is, the pre- 
30 loaded classes can be assembled or compiled into class object data and the header files and source files can be 
compiled with VM sources into VM object data. The VM and class object data can then be linked in the conventional 
manner into the executable VM for a particular Java environment. 

BRIEF DESCRIPTION OF THE DRAWINGS 

35 - 

[0025] Additional objects and features of embodiments of the invention will be more readily apparent from the fol- 
lowing detailed description and appended claims when taken in conjunction with the drawings, in which: 



20 



distinct values of each type of member, 
amount of space required to store the values, 
the size of the value indices, and 
the number of occurrences of each member type. 



25 



FIG. 1 illustrates the class file format common to the prior art and embodiments of the present invention; 



40 



FIG. 2A is a block diagram of VM internal data structures used in the prior art to encode class, method and field 
information; 

FIG. 2B illustrates the data structure declarations that define the format of the VM internal data structures shown 
in FIG. 2A; 



45 



FIG. 3 is a block diagram of a distributed computer system in which the class preloader system and method of the 
present invention can be embodied. 



so 



FIG. 4 is a block diagram of a execution engine in the distributed computer system of FIG.1 in which the preloaded 
classes generated by the class preloader of FIG. 3 are loaded into ROM; 



FIG. 5 is a flow diagram illustrating the processing components used to produce the preloaded executable module; 



55 



FIG. 6 is a flow diagram illustrating the processing components used to reduce the memory footprint of the preload- 
ed executable module; 



FIG. 7 A illustrates the organization of the updated header file 614 of FIG. 6; 



4 



BNSDOCID: <EP 0943989A2_L> 



C C 



20 



25 



30 



35 



40 



45 



SO 



SS 



EP 0 943 989 A2 

FIG. 7B illustrates the organization of the value table 616 ot FIG. 6; 

FIG 8 A Wustrates the organization of same member occurrences and values after allocation in the execution 
engine ROM 208 in accordance with embodiments of the present invention; execution 

20B in ^1^^ ^ OCCUrr6nCeS a,,9r al,OCa,i ° n h - °"f • ™ 

- fmentfof SS^S° , ^ ,0n °' 3 "** StrUCtUre ^ ^ fiVe b * 

FIG. 9B illustrates the organization of the data structure instance from FIG. 9A generated by the prior art; 

preload clashes; and°' *" * *" * bUM th ° intemal s,ructures used in tne 

!!5J 1 18 3 blOCk t !i. a9r w m Sh ° Win9 the mapping ° f a P feloaded application into read-only memory and random- 
access memory and ,nd.cating the loading of the portion of the methods and data mapped into randomness 
memory by a static class initializer. >««iaom-access 



DESCRIPTION OF THE PREFERRED EMBODIMENT 

^!™^ and ^^ 

«M ^VT^ ° P St6ra9e ^ R ° M ° f 3 ^ COmpUt6r < referred to herei " *• execution 

fhl 9 , 2" 21. en ?' ne fe " ke,y t0 be 3 C ° mputer With ,ittle ° f no secondan, storage, it is preferable 

that the Java class preloader be .implemented in a separate computer, which shall be referred to herein as~a server 
Assuming such a configuration, the preloaded classes could be transferred from the server to the execution engine in 
a vanety of ways (e.g. network connection, direct communication link or "sneaker net' transfer of readTble mediT 
such as floppy disks or CDs). Accordingly, a preferred embodiment of the present invention described herein is dlre^te^j 
to a computer system wrth a server and an execution engine wherein the preloaded classes are general by he 

ZZZJXSXZ Ss™ the execution en9ine ,or use in the VM The ~- *™ 

[00271 Referring to FIG. 3. there is shown a distributed computer system 100 in which embodiments of the present 
invention may be implemented. The computer system 100 has one or more execution engines 102 an Pen! o moTe 
fntemet SKE, I* 1' T ^^ent. each execution engine 102 is connect to the serveMoVviaThe 
Internet 106, although other types of communication connections between the computers 102, 104 could be used fe 

C^H^S^T' dir8Ct C H ° mmunication link or " sneaker t^sfer of readable media, such as floppy disks or 
CDs). Preferabfc the server and execut.on engines are desktop computers, such as Sun workstations. IBM compatible 

ZZ ^ ° r MaCintOSh computere - however ' vmuaHy any type of computer can be a sen/er or^xTu ^ 

Zl tZ 6 ' SySt6m 18 T " mi,ed l ° 3 diStnbUt6d COmputer svstem 11 be ^P'emented in vaS 
computer systems and in various configurations, or makes or models of tightly^oupled processors or in various con- 
figurations of loosely-coupled microprocessor systems. """.SOTS or in various con 
[0028] The server computer 104 typically includes one or more processors 112, a communications interface 116 a 
user interface 1 1 4. and memory 1 1 0. The memory 1 1 0 stores: mumcairons (menace 1 1 6, a 

• an operating system 118; 

• an Internet communications manager program or other type of network access procedures 1 20 

• a compiler 1 22 for translating source code written in the Java programming language into a stream of bytecodes 
a source code repository 124 including one or more source code files 126 containing Java source code 

• a class file repository 128 including one or more platform-independent class files 130 and one or more class 
Hbrar.es 1 31 contaming class files, each class file containing the data representing a particular class- 

• tSST ,Z ^ 1 ^ *H 9enera,es a 3et of P retoaded c,asses '<« a particular configuration of the execution 
engine (the class preloader is sometimes referred to as a static class loader) 

• an assembler 1 34 that produces an object file representing the class members, class data structures and memory 
storage indicators in a format that is recognizable for the linker; "emory 

• a linker 1 36 for determining the memory layout for a set of preloaded classes and for resolving all symbolic refer- 

> one or more data files 1 46 for use by the server (including the preloaded classes 1 48). 
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148) can be copied to the execution engine 102. processors 202. a communications 

program option. In .he P' ef «"^^™*™"^ ( ^ ^es" *K« P^"^ '"C'^e an op.ra.ing sy 

the preloaded classes 232. 

[0031] The random access memory 210 stores: 

. a second portion of the Java applications 215 and support procedures 216 217 'that contain methods having 

unresolved references and data that is altered during the applicafon's execution, and 
. one or more data files 228 that the execution engine may utilize during rts process.ng. 

mmmwmmm 

£££ £^d~ r^STJJL rep-sen*,*, o. .he mm. <*» .200 coding ft. 

(S^nte^ened errtoodimen, a member o. *. interna, da« can b. rep,« e n,«. In one C «» w a>S : 



6 



10 



c c 

EP 0 943 989 A2 

1) as a generic memory word (e.g.. of 32 bits) that is the value of the member or 

2) as an index to a table of distinct values that can be taken by the member for each occurrence of the member. 

[0039] The first representation is the only representation used in the data structures emitted by the prior art class 
preloader. Th.s representation can be very inefficient when a particular member tor which hundreds or thousands of 
occurrences exist only has a few distinct values. In such a situation, a full width memory word (e.g.. 32 bits wide) is 
allocated for each of the occurrences, taking up as many as thousands of words of scarce storage in the ROM 208 

IZSSTf 3 k W diff !!. ent K a ' U8S ^ S, ° red ThS S6COnd ^P^entatton. which is employed by theprese^i 
embod ment. solves this problem by generating a value table 616 to hold the definite values of such a member and 
generating for each occurrence of the member an index of only as many bits as is necessary to address all of the value 
table entnes^The second representation is advantageous when the memory that would be allocated for the indices 

SETS l^l P !l ,C K ar K member tyPS iS 8mal,er tha " the altocated memof y ^ uired *» *• gen^c represen 
L^e™ M ™" X dete - ineS h ° W to «~ - ^structures' is de- 

16 ?T th9 determ |" a,ion of how to ^present the members is made, the class preloader 1 32 outputs for each 

^Z^LT °1 " index+tab,e ,ormat "P* 3 "* formation 614 (including modified member 

value table 6 6. The header .nformafon 61 4 and value tables 616, which are generated as source code, are compiled 
by me compter 122 along with the virtual machine sources 618 that define the virtual machine to be executedTthe 
e^onewnelOS. The.,nker136 links the resulting object data 620 arKl the c^jectoata 622 to generate the prek^d 

ZSZttSfS?* ' " h ' Ch , bS ^ int ° ,h ° 9XeCUti0n en9ine 102 ° ne b ™«*»« * «he ^esent^m- 
bod.men is that, whenever new classes or members are to be incorporated in the preloaded classes 1 48. a new VM 

246 must be generated. Th,s is because the corresponding header information 614 and value tables 616 must be 

compiled w,* the VM sources 618. However, because the present embodiment automatically generates the headed 

to the VM code. This .s because the VM 246 always makes use of the accessor functions that are part of the header 

^° n f 61 J I ' PrSSent embodiment is ab,e to ^oeratB an efficient representation of date structuremem- 
bers while facilitating generation of the VM. 

30 ^L^T^T Pre,0ader 1 32 ? ab,e to A 9 enera,e ,he efficient fdexAable member representation because all pos- 
30 Sl ble values of the members are known. As a result, the number of bits needed for each index is also known The 
number of occurrences of each members is also known. Moreover, the preferred embodiment presumes that the class 
files 1 28 represent the complete set of classes that are to be preloaded into the target execution eng ne 102 This 

,oTr p tnr,L esp r ,a,,y appiicab ; e to execution en9ines 102 ,hat are »■"*•« ^^ s , jzjzz 

maZr ^JJXJ? T a ^ r ~ mmunications b ^wi°1h to download classes on the fly in the conventional 
™U , » number ° f ,nd,ces values are ^°wn and that there is no possibility of adding additional 
members or classes, rt is possible for the class preloader 132 to arrange the indices to^ave an optimally compact o 
near-optimal arrangement when allocated by the execution engine 102. The class preloader 132 achieves thte level 
of compaction by selecting the order of the indices in the updated header information 614 

t [ r^L e ?i?«fi 9 iR° ^ 7 t 8nd 7 * th6re iS '" UStrated the or 9an^tion of the updated header information 614 and 
ZT* 1 al ™ 9 * ,1h *P ecrf,c exam P |e * of each data structure. These examples represent the outputs gen- 
V Preloader 132 corresponding to the data structure declaration 1230 N from FIG 2B 

[0043] The updated header file 614 shown in FIG. 7A includes a set of data structure declarations 702, each of which 
can include, in any combination, updated member declarations 704 and un-updated member declarations 706 Each 
data structure declarat.on 702 corresponds to one of the data structures used by the VM 246. The updated member 
declarations 704 are for data structure members that have been modified by the class preloader 13? as indTxWe 
members and the un-updated members 706 are for data structure members that the class preloader 1 32 determined 
^T!^ 9ene / iCal * Each structure declaration 702 is associated with updated member table dec- 
larations 708 updated member accessor functions 71 0 and un-updated member accessor functions 712. Each updated 
member table decfarahon 708 is associated with a corresponding value table 616 and declares that table in the ap- 
propriate Programm '"9 language. An upclated member accessorfunction 710def,n B stheaccessorfunction for urxteted 
(i.e., index/table) members using the table name defined in the respective updated member table declaration 708 The 
un-updated member accessor functions 712 are unchanged from those generated by the conventional class preloader 
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[0044] For example, FIG. 7A shows the updated header file information 614 for the data structure 1230.N (Struct T) 
from FIG. 2B. This example assumes that the class preloader 1 32 determined that: 

1) member! has 400 values and is best represented as an index/table member, 

2) member2 is best represented conventionally, 
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3) member3 has 200 values and is best represented as an index/table member, 

4) m&mber4 has 1500 values and is best represented as an index/table member, and 

5) members is best represented conventionally. 

5 [0045] Consequently, the class preloader 1 32 has generated a modified "struct T" declaration 704 wherein memberl 
is represented as a 9-bit integer index m1_idx (9-bits being enough to access 200 values), members is represented 
as an 8-bit integer index m3_idx (enough to access 400 values) and member4 is represented as an 1 1 -bit integer index 
m4_idx (enough to access 1500 values). The other members, member2 and member5 f are left unmodified as generic 
members of type mtype2 and mtypeS, respectively. 

io [0046] The class preloader 1 32 has also generated an updated member table declaration 708 for memberl showing 
that the memberl values are stored in a value table (memberl _value[]) of type memberl. The member1_value table 
is declared as an external variable (extern), which tells the compiler 122 that the actual values of the table are defined 
in another file, in this case the value tables file 616. Similar updated member table declarations 708 are generated for 
member3 and member*. 

is [0047] The accessor function 710 for the updated memberl is correspondingly modified so that each time the cor- 
responding accessor function, memberl of (T), is invoked the VM 246 that accesses the preloaded methods uses the 
memberl value (i.e., the 9-bit mljdx) as an index into the member1_vatue table. The accessor functions 710 for the 
updated members and member 4 are modified in similar fashion. 

[0048] Referring to FIG. 7B, there is shown a representation of the value tables 616, including a table 722.1 that 
20 defines the definite values that can be taken by the memberl _vafue table declared in the header file 61 4. In this case, 
the member1_value table is defined as a constant array ("const mtypel member1_value[]") consisting of 400 values, 
val 1 . . val 400. Similar representations of the value tables for member3and member4 are also provided (e.g., in the 
member 3 and 4 tables 722.3, 722.4). 

[0049] Referring to FIG. 8A, there is shown an illustration of the manner in which the internal data structures(specif- 
25 icaily, the member occurrences 802 and value table 806 for a single member type) of the present embodiment are 
organized in the execution engine ROM 208. Each of the occurrences represents one occurrence in a preloaded class 
of the same member 802 and the data structure type 805 that encompasses it (the data structure type 805 is likely to 
include multiple members - e.g., see FIG. 2B). Assuming that a particular member has N distinct values 808, which 
are stored in the value table 806, each of the M occurrences 802 of that member is allocated as an index 804 of width 
30 (\ k>g^N)\ + 1) bits to the entry of the value table 806 that holds the member's value. For example, each of the occurrences 
802.1 and 802.6 is an index to the table entry 806.N. This entry 806.N stores the definite value 808.N associated with 
those member occurrences. Thus, the total memory usage of this model is M*(\log<>(N)\+ 1) + value_table_size bits per 
member. 

[0050] Referring to FIG. 8B, there is shown an illustration of the manner in which the prior art organizes occurrences 
3S 852 in the execution engine ROM 208 of a particular member. Each of the occurrences 852 represents one occurrence 
in a preloaded class of a particular member. Each of the M occurrences 852 of that member is allocated as a full-width 
memory word that stores the value 854 of the member for that occurrence (i.e., each of these occurrences are repre- 
sented in the first format referred to above.) Thus, the total memory usage of this model is M*32 bits (assuming 32-bit 
memory words). As a result, the present embodiment saves memory allocated for a particular member in the data 
40 structures when M^hg^N)/^ 1)+ value_tab/e_size is less than M*memory_word_size (e.g., M*32). As in the example 
of FIG. 8A, the fields 802 are likely to be just one element in a data structure declaration. 

[0051] Referring to FIG. 9A, there is shown an example of how the class preloader 132 of the present embodiment 
efficiently stores in the execution engine ROM 208 all of the members 802 of a particular data structure 902 (e.g., the 
members of the structure, Struct T 1230.N, FIG. 2B). Generally, the present embodiment packs the stored values (i. 

45 e., the indices 804) so that they occupy as much of a fixed length memory word as possible. In the illustrated situation, 
the memory words are 32 bits wide, but the present embodiment is applicable to memory words of any length. In the 
example shown in FIG. 9A, the 9-bit, 8-bit and 11 -bit members ml, m3 and m4 from Struct T 902 are packed into a 
single 32-bit memory word. Values of the members m2, m5, which are represented conventionally (e.g., as 32-bit 
values), are stored in respective 32-bit general variables following the first word. In the preferred embodiment, these 

so conventionally-represented members must be aligned on word boundaries (e.g., every 32 bits). There is no such re- 
quirement for the modified members. Therefore, for each data structure instance there are only 4-bits of unused space 
904 between the fourth member m4 and the first general variable m2. The class preloader 132 aims to pack the mem- 
bers of an internal data structure into memory words as efficiently as possible given any combination of member rep- 
resentations and member sizes. 

55 [0052] Referring to FIG. 9B, there is shown a diagram illustrating the format of the same data structure Struct T as 
stored by the prior art class preloader. Note that, in this system, the data structure requires 5 words to store the 5 
members. Thus, the pnor art is far less efficient than embodiments of the present invention (which only need 3 words 
to store the same data structure information). Embodiments of the method of the present invention are now descnbed 
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in reference to FIG. 1 0. 

[0053] This arrangement presents noproblems to the preloaded classes' use of the accessor functions as the different 
memory locations of the indices 804 are resolved by the compiler 122 and the indices themselves store the index of 
their associated value 808. 

[0054] Referring to FIG. 1 0. there is shown a flow diagram of an embodiment of the method of the present invention 
implemented in the class preloader 1 32. The present method is implemented in two passes, which include an account- 
ing pass (represented by the box labeled 1104) and a data structure declaration generation pass (represented by the 
rest of the steps). As the first accounting step (performed for all internal data structures), the preloader 132 identifies 
all member types of an internal data structure (1106). For example, referring to FIG. 2B, the five members of Struct T 
are that data structure's member types. For each member type, the class preloader 132 then performs the followina 
processing: a 

Identify M occurrences of the member type (1108). 
Identify N values of the M occurrences (1110). 
Determine the memory space needed to store each value (1112). 

Determine the memory space needed to store an index that can address the N values (the index must be at least 
\log2(N)\+ 1 bits) (1114); 

Determine the size of the conventional representation of the member occurrences (1116). y 

[0055] This processing is performed on all members of all internal dala structures before proceeding with the steps 
starting with box 1 1 1 8. This order of processing is preferable as the accounting statistics generated by the procedures 
in the box 1104 are used by the subsequent second pass steps. Typically, the accounting statistics are stored tempo- 
rarily for use in the second pass. ^ 
[0056] Once all of the statistics have been generated, the class preloader 1 32 computes for each member type: 

the memory space (LHS) required by the conventional representation of the member occurrences (1120)* and 
the memory space (RHS) required by the novel representation of each member occurrence as an index to a value 
table (1122). 

30 [0057] The class preloader 1 32 computes the LHS value in step 11 20 as follows: 

LHS = (size of the conventional representation) x no. of occurrences 
(size of the conventional representation) x M bits. 
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[0058] The class preloader 1 32 computes the RHS value in step 1 122 as follows: 

RHS = (size of the member value) x no. of occurrences + size of the value table 
(size of the member value) xM+Mx (|log 2 (N)|+ 1 ) bits. 

[0059] If the RHS is smaller than the LHS (1 1 24-Y), the class preloader 1 32 represents that member type as a value 
table and indices { 1 1 26). If the RHS is not smaller than the LHS (1 1 24-N), the class preloader will represent that member 
type conventionally ( 1 1 28). 

so p^LedO^ StGPS 1120 ' 1122 ' 1124 ' 1126 ' 1128 Wh " 6 ° thef members remain to b€ > 

[0061] Once each member has been processed (1124-Y), the class preloader 1 32 performs a data structure decla- 
ration generation procedure 1130. In this procedure, for each data structure the class preloader 132 determines the 
optimal ordering of the data structure members (1 1 32). The ordering process and its considerations have already been 
described in reference to FIG. g A . The class preloader 132 then generates the member header information 614 and 
values table 616 in accordance with the optimal ordering (1134). The generation of the header information 614 and 
member values 616 has already been described in reference to FIGS. 7 A and 7B. 

[0062] Referring to Fig. 1 1 , the preloaded executable module and boot time initiator 1 320 are permanently stored in 
the read-only memory of an execution engine computer Each time the execution engine computer is powered on or 
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rebooted the boot time initiator 1 320 is automatically executed. Among other tasks, the boot time initiator copies all 
methods and data that must be resident in random access memory during execution to the random access memory 
locations assigned to them by the linker. 

r0063] Although the method and system described herein have been described with reference to the Java program- 
s ming language the present invention is applicable to computer systems using other object^riented classes that utilize 
dynamic runtime loading of classes. 

r0064] Further embodiments of the present invention are amenable for execution on various types of executable 
mediums other than a memory device such as a random access memory. Other types of executable mediums can be 
used, such as, but not limited to, a computer-readable storage medium, which can be any memory device, compact 
10 disc, or floppy disk. 

10065] The aforementioned system and method have been described with respect to executing a generic Java ap- 
plication and are applicable to any Java application. For example, the embodiments of the present invention could be 
employed to preload classes used by a personal information manager coded in Java intended to run on a handheld 
computer Moreover, the Java application need not be run in a distributed environment, it can run in stand-alone mode 
75 executing in a execution engine or server computer without importing new classes from external systems. 

[0066] While embodiments of the present invention have been described with reference to a few specific embodi- 
ments the description is illustrative of the invention and is not to be construed as limiting the inventon. \fcrious mod- 
ifications may occur to those skilled in the art without departing from the scope of the invention as defined by the 
appended claims. 
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Claims 

1 . A method for reducing memory footprint of preloaded classes to be incorporated into a runtime environment, com- 
25 prising the steps of: 

determining types of data structures represented in one or more class files used to define a plurality of preload- 
ed classes, each of the data structure types including one or more members; 
determining distinct values that can be taken by each of the members; 

storing each of the values for at least a subset of the members selected to reduce the size of corresponding 
internal data structures composing the preloaded classes; 

generating a set of value indices for addressing stored values stored in the storing step; and 
generating accessor functions and member declarations that enable the runtime environment to use the se- 
lected members represented as the stored values and the set of value indices. 

!. A system for reducing the memory footprint of preloaded classes to be incorporated into a runtime environment, 
comprising: 

a set of class files; and 

40 a class preloader configured to generate from the class files a set of preloaded classes and a plurality of 

internal data structure declarations configured to minimize size of the preloaded classes when allocated in 

memory; . 4 

the class preloader being configured to generate the internal data structure declarations so that each of a set 
of selected data structure members are represented as an index to a storage structure holding distinct values 
45 of the selecied data structure member. 

3. A computer program product for use in a computer system for reducing the memory footprint of preloaded classes 
to be incorporated into a runtime environment executing on an execution engine, the computer program product 
including a computer readable storage medium and a computer program mechanism embedded therein, the corn- 
so puter program mechanism comprising: 

a class preloader configured to generate from a set of class files the preloaded classes and a plurality of 
internal data structure declarations configured to minimize size ofthe preloaded classes when allocated in 
memory of the execution engine; 
55 the class preloader being configured to generate the internal data structure declarations so that each of a set 

of selected data structure members are represented as an index to a storage structure holding distinct values 
of the selected data structure member. 
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A runtime environment built from a collection of preloaded classes defined by one or more internal data structure 
types, each including one or more members, the runtime environment comprising: 

a storage structure holding distinct values that can be taken by at least a subset of the members; and 
an index to a storage structure entry holding the distinct value of a respective member of the subset of the 
members serving as each occurrence of the respective member with the distinct value; 
such that the runtime environment determines when necessary the distinct value of the respective.member 
by retrieving contents of the storage structure at a location defined by the index. 

The runtime environment of claim 4, wherein all of the distinct values are known and the index to the storage 
structure entry is implemented using fewest bits able to index all of the distinct values associated with the respective 
member. 

A method for loading an execution engine with preloaded classes to be incorporated into a runtime environment 
to be executed on the execution engine, comprising: downloading into the execution engine the preloaded classes, 
including a plurality of internal data structure declarations composing the preloaded classes configured so that 
each of a set of selected data structure members is represented as an index to a stored distinct value of the 
selected data structure member, the set being selected to minimize size of the preloaded classes when allocated 
in memory of the execution engine. 

The method of claim 6, wherein the preloaded classes are downloaded over the Internet. 

The method of claim 6, further comprising: allocating the preloaded classes in the memory of the execution engine. 

A method for generating and loading into a client preloaded classes to be incorporated into a runtime environment 
to be executed on the client, comprising: 

generating in a server the preloaded classes, including a plurality of internal data structure declarations com- 
posing the preloaded classes configured so that each of a set of selected data structure members is repre- 
sented as an index to a storage structure holding distinct values of the selected data structure member, the 
set being selected to minimize size of the preloaded classes when allocated in memory of the client; and 
downloading into the client the preloaded classes. 

10. A method for reducing memory footprint of preloaded classes to be incorporated into a runtime environment, com- 
35 prising the steps of: 

determining distinct values that can be taken by members of internal data structures composing the preloaded 
classes; 

storing each of the values for at least a subset of the members selected to reduce the size of the internal data 
40 structures; and 

replacing each occurrence of a subset member with an index to the value of that occurrence, enabling the 
value of that occurrence to be retrieved via the index. 

11. A system for reducing the memory footprint of preloaded classes to be incorporated into a runtime environment, 
45 comprising. 

a class preloader configured to generate a set of internal data structure declarations configured to minimize 
size of the preloaded classes when allocated in memory, the internal data structure declarations declaring 
internal data structures composing the preloaded classes; 
50 the class preloader being configured to generate the internal data structure declarations so that each of a set 

of selected data structure members is represented as an index to a storage structure holding distinct values 
of the selected data structure member. 
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